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In the last year 


e Added SSA-based NIR (the New Intermediate Representation) 
e NIR in use by default in 1965/vec4, 1965/fs, vc4, and freedreno 

- Net reduction of 2 backends in 1965 (fs, fo > NIR/fs, vs, vp > NIR/vec4) 
e Mostly stopped working on the tree-based “GLSL IR” 

- 48 optimization patches to NIR 

- 9 optimization patches to GLSL IR 
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In the last year... in the 1965 backend 


e NIR enabled by default 
- Cut 12% of instructions in ARB fragment programs 
e Added pass to combine immediate-value loads 
- Packs 8 values into each register 
- Allows unconditional use of MAD instructions! 
e New conditional-modifier propagation pass 
e Added flag-register dead code elimination 


- Rewrote vec4 dead code elimination pass 
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(New!) shader-db 


e Still a collection of 25k *.shader test files gathered from games and 
benchmarks 


- Plus scripts to eempHe-them-and collect statistics 

e “Runner” script replaced by nice C program using the latest goodness 
- Render nodes, EGL, GBM, libepoxy 
- Single process, uses OpenMP to compile shaders in parallel 


- Feeds compiler stats (instruction counts, loops, spills, etc) back via 
KHR debug 


- 300 second runtime reduced to 90 
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Another year's worth of compiler 


improvements 
| 
| total instructions in shared programs: 6615500 -> 5996928 (-9.35%) | 
| instructions in affected programs: 6165481 -> 5575266 (-9.57%) | 
| GAINED: 236 | 
| LOST: 154 | 
| 


e Previous year was -16.50%, so about -25% in two years 
e Broadwell and newer uses scalar mode for vertex shaders 
e Support for SIMD16 on Gen4 (Improved FPS of Shadowrun Returns by 20%) 


e Support for SIMD16 with control flow added on Gen4 and Gen5 (ILK and 
older) 
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Questions (so far) 
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NIR is here. What to do now? 


e NIRis an additional IR. Hasn't simplified situation overall 
e Reduce or remove optimizations in GLSL IR 


- GLSL IR optimizations are inefficient and slow (and sometimes not even 
effective) 


- Shadertoy.com shader compiles in 25 seconds... or 5 
e GLSL linking in NIR 


- Would allow better pre-linking optimizations 


e Getrid of Mesa IR 
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(965) (Tosi) (vet) Greedreno) (1200) (1915) (swrast) 
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What do we need? 


Software 


e Buy-in from TGSI consumers to go through NIR 


* How they would benefit: 


Share more code, optimizations, lowering passes (gl ClipVertex, GL CLAMP, 
texture rectange scaling, texture projection, integer division) 


Get to delete st glsl to tgsi.cpp and ir to mesa.cpp (NIR > TGSI remaining) 
Better compile times, in some cases significantly 
SPIR-V support 


Maybe better generated code 
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What do we need to do? 


e Port Mesa IR consumers to NIR (1915, r200, swrast) 

e Port Mesa IR producers to NIR (FF vertex pipeline, ARB fp, ARB vp) 
e Port FF fragment pipeline to NIR 

e Fix-up NIR > TGSI translators...? 


e Of course, testing and benchmarking 
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Discussion 
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